Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 37
1.
Genome Biol ; 25(1): 103, 2024 Apr 19.
Article En | MEDLINE | ID: mdl-38641849

Spatially resolved transcriptomics technologies have opened new avenues for understanding gene expression heterogeneity in spatial contexts. However, existing methods for identifying spatially variable genes often focus solely on statistical significance, limiting their ability to capture continuous expression patterns and integrate spot-level covariates. To address these challenges, we introduce spVC, a statistical method based on a generalized Poisson model. spVC seamlessly integrates constant and spatially varying effects of covariates, facilitating comprehensive exploration of gene expression variability and enhancing interpretability. Simulation and real data applications confirm spVC's accuracy in these tasks, highlighting its versatility in spatial transcriptomics analysis.


Gene Expression Profiling , Transcriptome , Computer Simulation , Spatial Analysis , Gene Expression
2.
Article En | MEDLINE | ID: mdl-38641207

BACKGROUND & AIMS: Lacticaseibacillus rhamnosus GG (LGG) is the world's most consumed probiotic but its mechanism of action on intestinal permeability and differentiation along with its interactions with an essential source of signaling metabolites, dietary tryptophan (trp), are unclear. METHODS: Untargeted metabolomic and transcriptomic analyses were performed in LGG monocolonized germ-free mice fed trp-free or -sufficient diets. LGG-derived metabolites were profiled in vitro under anaerobic and aerobic conditions. Multiomic correlations using a newly developed algorithm discovered novel metabolites tightly linked to tight junction and cell differentiation genes whose abundances were regulated by LGG and dietary trp. Barrier-modulation by these metabolites were functionally tested in Caco2 cells, mouse enteroids, and dextran sulfate sodium experimental colitis. The contribution of these metabolites to barrier protection is delineated at specific tight junction proteins and enterocyte-promoting factors with gain and loss of function approaches. RESULTS: LGG, strictly with dietary trp, promotes the enterocyte program and expression of tight junction genes, particularly Ocln. Functional evaluations of fecal and serum metabolites synergistically stimulated by LGG and trp revealed a novel vitamin B3 metabolism pathway, with methylnicotinamide (MNA) unexpectedly being the most robust barrier-protective metabolite in vitro and in vivo. Reduced serum MNA is significantly associated with increased disease activity in patients with inflammatory bowel disease. Exogenous MNA enhances gut barrier in homeostasis and robustly promotes colonic healing in dextran sulfate sodium colitis. MNA is sufficient to promote intestinal epithelial Ocln and RNF43, a master inhibitor of Wnt. Blocking trp or vitamin B3 absorption abolishes barrier recovery in vivo. CONCLUSIONS: Our study uncovers a novel LGG-regulated dietary trp-dependent production of MNA that protects the gut barrier against colitis.

3.
Infect Immun ; 92(3): e0053923, 2024 Mar 12.
Article En | MEDLINE | ID: mdl-38299827

The obligate intracellular bacterium Chlamydia has a unique developmental cycle that alternates between two contrasting cell types. With a hardy envelope and highly condensed genome, the small elementary body (EB) maintains limited metabolic activities yet survives in extracellular environments and is infectious. After entering host cells, EBs differentiate into larger and proliferating reticulate bodies (RBs). Progeny EBs are derived from RBs in late developmental stages and eventually exit host cells. How expression of the chlamydial genome consisting of nearly 1,000 genes governs the chlamydial developmental cycle is unclear. A previous microarray study identified only 29 Chlamydia trachomatis immediate early genes, defined as genes with increased expression during the first hour postinoculation in cultured cells. In this study, we performed more sensitive RNA sequencing (RNA-Seq) analysis for C. trachomatis cultures with high multiplicities of infection. Remarkably, we observed well over 700 C. trachomatis genes that underwent 2- to 900-fold activation within 1 hour postinoculation. Quantitative reverse transcription real-time PCR analysis was further used to validate the activated expression of a large subset of the genes identified by RNA-Seq. Importantly, our results demonstrate that the immediate early transcriptome is over 20 times more extensive than previously realized. Gene ontology analysis indicates that the activated expression spans all functional categories. We conclude that over 70% of C. trachomatis genes are activated in EBs almost immediately upon entry into host cells, thus implicating their importance in initiating rapid differentiation into RBs and establishing an intracellular niche conducive with chlamydial development and growth.


Chlamydia Infections , Chlamydia trachomatis , Humans , Cells, Cultured , Base Sequence , Transcriptome , Real-Time Polymerase Chain Reaction , Chlamydia Infections/genetics
4.
Lab Invest ; 104(4): 100330, 2024 Apr.
Article En | MEDLINE | ID: mdl-38242234

Intestinal microbiota confers susceptibility to diet-induced obesity, yet many probiotic species that synthesize tryptophan (trp) actually attenuate this effect, although the underlying mechanisms are unclear. We monocolonized germ-free mice with a widely consumed probiotic Lacticaseibacillus rhamnosus GG (LGG) under trp-free or -sufficient dietary conditions. We obtained untargeted metabolomics from the mouse feces and serum using liquid chromatography-mass spectrometry and obtained intestinal transcriptomic profiles via bulk-RNA sequencing. When comparing LGG-monocolonized mice with germ-free mice, we found a synergy between LGG and dietary trp in markedly promoting the transcriptome of fatty acid metabolism and ß-oxidation. Upregulation was specific and was not observed in transcriptomes of trp-fed conventional mice and mice monocolonized with Ruminococcus gnavus. Metabolomics showed that fecal and serum metabolites were also modified by LGG-host-trp interaction. We developed an R-Script-based MEtabolome-TRanscriptome Correlation Analysis algorithm and uncovered LGG- and trp-dependent metabolites that were positively or negatively correlated with fatty acid metabolism and ß-oxidation gene networks. This high-throughput metabolome-transcriptome correlation strategy can be used in similar investigations to reveal potential interactions between specific metabolites and functional or disease-related transcriptomic networks.


Gastrointestinal Microbiome , Lacticaseibacillus rhamnosus , Mice , Animals , Intestines , Gastrointestinal Microbiome/genetics , Gene Expression Profiling , Fatty Acids
5.
NAR Genom Bioinform ; 6(1): lqae004, 2024 Mar.
Article En | MEDLINE | ID: mdl-38288376

Analyzing single-cell RNA sequencing (scRNA-seq) data remains a challenge due to its high dimensionality, sparsity and technical noise. Recognizing the benefits of dimensionality reduction in simplifying complexity and enhancing the signal-to-noise ratio, we introduce scBiG, a novel graph node embedding method designed for representation learning in scRNA-seq data. scBiG establishes a bipartite graph connecting cells and expressed genes, and then constructs a multilayer graph convolutional network to learn cell and gene embeddings. Through a series of extensive experiments, we demonstrate that scBiG surpasses commonly used dimensionality reduction techniques in various analytical tasks. Downstream tasks encompass unsupervised cell clustering, cell trajectory inference, gene expression reconstruction and gene co-expression analysis. Additionally, scBiG exhibits notable computational efficiency and scalability. In summary, scBiG offers a useful graph neural network framework for representation learning in scRNA-seq data, empowering a diverse array of downstream analyses.

6.
mBio ; 15(1): e0203623, 2024 Jan 16.
Article En | MEDLINE | ID: mdl-38112466

IMPORTANCE: Hallmarks of the developmental cycle of the obligate intracellular pathogenic bacterium Chlamydia are the primary differentiation of the infectious elementary body (EB) into the proliferative reticulate body (RB) and the secondary differentiation of RBs back into EBs. The mechanisms regulating these transitions remain unclear. In this report, we developed an effective novel strategy termed dependence on plasmid-mediated expression (DOPE) that allows for the knockdown of essential genes in Chlamydia. We demonstrate that GrgA, a Chlamydia-specific transcription factor, is essential for the secondary differentiation and optimal growth of RBs. We also show that GrgA, a chromosome-encoded regulatory protein, controls the maintenance of the chlamydial virulence plasmid. Transcriptomic analysis further indicates that GrgA functions as a critical regulator of all three sigma factors that recognize different promoter sets at developmental stages. The DOPE strategy outlined here should provide a valuable tool for future studies examining chlamydial growth, development, and pathogenicity.


Chlamydia Infections , Chlamydia trachomatis , Humans , Chlamydia trachomatis/metabolism , Gene Expression Regulation, Bacterial , Transcription Factors/metabolism , Sigma Factor/genetics , Bacterial Proteins/genetics , Bacterial Proteins/metabolism
7.
J Med Virol ; 95(10): e29187, 2023 10.
Article En | MEDLINE | ID: mdl-37877809

Chronic infection of hepatitis B virus (HBV) is the major cause of hepatocellular carcinoma (HCC). Notably, 90% of HBV-positive HCC cases exhibit detectable HBV integrations, hinting at the potential early entanglement of these viral integrations in tumorigenesis and their subsequent oncogenic implications. Nevertheless, the precise chronology of integration events during HCC tumorigenesis, alongside their sequential structural patterns, has remained elusive thus far. In this study, we applied whole-genome sequencing to multiple biopsies extracted from six HBV-positive HCC cases. Through this approach, we identified point mutations and viral integrations, offering a blueprint for the intricate tumor phylogeny of these samples. The emergent narrative paints a rich tapestry of diverse evolutionary trajectories characterizing the analyzed tumors. We uncovered oncogenic integration events in some samples that appear to happen before and during the initiation stage of tumor development based on their locations in reconstituted trajectories. Furthermore, we conducted additional long-read sequencing of selected samples and unveiled integration-bridged chromosome rearrangements and tandem repeats of the HBV sequence within integrations. In summary, this study revealed premalignant oncogenic and sequential complex integrations and highlighted the contributions of HBV integrations to HCC development and genome instability.


Carcinoma, Hepatocellular , Hepatitis B , Liver Neoplasms , Humans , Hepatitis B virus/genetics , Carcinogenesis , Cell Transformation, Neoplastic
8.
EMBO J ; 42(21): e113975, 2023 11 02.
Article En | MEDLINE | ID: mdl-37718683

Paneth cells (PCs), a specialized secretory cell type in the small intestine, are increasingly recognized as having an essential role in host responses to microbiome and environmental stresses. Whether and how commensal and pathogenic microbes modify PC composition to modulate inflammation remain unclear. Using newly developed PC-reporter mice under conventional and gnotobiotic conditions, we determined PC transcriptomic heterogeneity in response to commensal and invasive microbes at single cell level. Infection expands the pool of CD74+ PCs, whose number correlates with auto or allogeneic inflammatory disease progressions in mice. Similar correlation was found in human inflammatory disease tissues. Infection-stimulated cytokines increase production of reactive oxygen species (ROS) and expression of a PC-specific mucosal pentraxin (Mptx2) in activated PCs. A PC-specific ablation of MyD88 reduced CD74+ PC population, thus ameliorating pathogen-induced systemic disease. A similar phenotype was also observed in mice lacking Mptx2. Thus, infection stimulates expansion of a PC subset that influences disease progression.


Microbiota , Paneth Cells , Humans , Animals , Mice , Paneth Cells/metabolism , Paneth Cells/pathology , Intestine, Small , Inflammation/pathology , Cytokines/metabolism
9.
Bioinformatics ; 39(9)2023 09 02.
Article En | MEDLINE | ID: mdl-37672035

MOTIVATION: Since the development of single-cell RNA sequencing (scRNA-seq) technologies, clustering analysis of single-cell gene expression data has been an essential tool for distinguishing cell types and identifying novel cell types. Even though many methods have been available for scRNA-seq clustering analysis, the majority of them are constrained by the requirement on predetermined cluster numbers or the dependence on selected initial cluster assignment. RESULTS: In this article, we propose an adaptive embedding and clustering method named scAce, which constructs a variational autoencoder to simultaneously learn cell embeddings and cluster assignments. In the scAce method, we develop an adaptive cluster merging approach which achieves improved clustering results without the need to estimate the number of clusters in advance. In addition, scAce provides an option to perform clustering enhancement, which can update and enhance cluster assignments based on previous clustering results from other methods. Based on computational analysis of both simulated and real datasets, we demonstrate that scAce outperforms state-of-the-art clustering methods for scRNA-seq data, and achieves better clustering accuracy and robustness. AVAILABILITY AND IMPLEMENTATION: The scAce package is implemented in python 3.8 and is freely available from https://github.com/sldyns/scAce.


Cluster Analysis , Gene Expression , Sequence Analysis, RNA
10.
bioRxiv ; 2023 Aug 02.
Article En | MEDLINE | ID: mdl-37577610

Chlamydia, an obligate intracellular bacterial pathogen, has a unique developmental cycle involving the differentiation of invading elementary bodies (EBs) to noninfectious reticulate bodies (RBs), replication of RBs, and redifferentiation of RBs into progeny EBs. Progression of this cycle is regulated by three sigma factors, which direct the RNA polymerase to their respective target gene promoters. We hypothesized that the Chlamydia-specific transcriptional regulator GrgA, previously shown to activate σ66 and σ28, plays an essential role in chlamydial development and growth. To test this hypothesis, we applied a novel genetic tool known as dependence on plasmid-mediated expression (DOPE) to create Chlamydia trachomatis with conditional GrgA-deficiency. We show that GrgA-deficient C. trachomatis RBs have a growth rate that is approximately half of the normal rate and fail to transition into progeny EBs. In addition, GrgA-deficient C. trachomatis fail to maintain its virulence plasmid. Results of RNA-seq analysis indicate that GrgA promotes RB growth by optimizing tRNA synthesis and expression of nutrient-acquisition genes, while it enables RB-to-EB conversion by facilitating the expression of a histone and outer membrane proteins required for EB morphogenesis. GrgA also regulates numerous other late genes required for host cell exit and subsequent EB invasion into host cells. Importantly, GrgA stimulates the expression of σ54, the third and last sigma factor, and its activator AtoC, and thereby indirectly upregulating the expression of σ54-dependent genes. In conclusion, our work demonstrates that GrgA is a master transcriptional regulator in Chlamydia and plays multiple essential roles in chlamydial pathogenicity.

11.
Brief Bioinform ; 24(5)2023 09 20.
Article En | MEDLINE | ID: mdl-37598422

The advent of single-cell RNA sequencing (scRNA-seq) technologies has enabled gene expression profiling at the single-cell resolution, thereby enabling the quantification and comparison of transcriptional variability among individual cells. Although alterations in transcriptional variability have been observed in various biological states, statistical methods for quantifying and testing differential variability between groups of cells are still lacking. To identify the best practices in differential variability analysis of single-cell gene expression data, we propose and compare 12 statistical pipelines using different combinations of methods for normalization, feature selection, dimensionality reduction and variability calculation. Using high-quality synthetic scRNA-seq datasets, we benchmarked the proposed pipelines and found that the most powerful and accurate pipeline performs simple library size normalization, retains all genes in analysis and uses denSNE-based distances to cluster medoids as the variability measure. By applying this pipeline to scRNA-seq datasets of COVID-19 and autism patients, we have identified cellular variability changes between patients with different severity status or between patients and healthy controls.


COVID-19 , Humans , COVID-19/genetics , Gene Expression Profiling/methods , Gene Expression , Sequence Analysis, RNA/methods , Cluster Analysis
13.
Brief Bioinform ; 24(1)2023 01 19.
Article En | MEDLINE | ID: mdl-36410733

Spatially resolved transcriptomics technologies enable the measurement of transcriptome information while retaining the spatial context at the regional, cellular or sub-cellular level. While previous computational methods have relied on gene expression information alone for clustering single-cell populations, more recent methods have begun to leverage spatial location and histology information to improve cell clustering and cell-type identification. In this study, using seven semi-synthetic datasets with real spatial locations, simulated gene expression and histology images as well as ground truth cell-type labels, we evaluate 15 clustering methods based on clustering accuracy, robustness to data variation and input parameters, computational efficiency, and software usability. Our analysis demonstrates that even though incorporating the additional spatial and histology information leads to increased accuracy in some datasets, it does not consistently improve clustering compared with using only gene expression data. Our results indicate that for the clustering of spatial transcriptomics data, there are still opportunities to enhance the overall accuracy and robustness by improving information extraction and feature selection from spatial and histology data.


Benchmarking , Transcriptome , Gene Expression Profiling/methods , Software , Cluster Analysis
15.
J Comput Biol ; 29(11): 1233-1236, 2022 11.
Article En | MEDLINE | ID: mdl-35920848

Data integration is a critical step in the analysis of multiple single-cell RNA sequencing samples to account for heterogeneity due to both biological and technical variability. scINSIGHT is a new integration method for single-cell gene expression data, and can effectively use the information of biological condition to improve the integration of multiple single-cell samples. scINSIGHT is based on a novel non-negative matrix factorization model that learns common and condition-specific gene modules in samples from different biological or experimental conditions. Using these gene modules, scINSIGHT can further identify cellular identities and active biological processes in different cell types or conditions. Here we introduce the installation and main functionality of the scINSIGHT R package, including how to preprocess the data, apply the scINSIGHT algorithm, and analyze the output.


Gene Expression Profiling , Single-Cell Analysis , Sequence Analysis, RNA/methods , Single-Cell Analysis/methods , RNA-Seq , Gene Expression Profiling/methods , Exome Sequencing , Algorithms , Cluster Analysis
17.
Genome Biol ; 23(1): 82, 2022 03 21.
Article En | MEDLINE | ID: mdl-35313930

The increasing number of scRNA-seq data emphasizes the need for integrative analysis to interpret similarities and differences between single-cell samples. Although different batch effect removal methods have been developed, none are suitable for heterogeneous single-cell samples coming from multiple biological conditions. We propose a method, scINSIGHT, to learn coordinated gene expression patterns that are common among, or specific to, different biological conditions, and identify cellular identities and processes across single-cell samples. We compare scINSIGHT with state-of-the-art methods using simulated and real data, which demonstrate its improved performance. Our results show the applicability of scINSIGHT in diverse biomedical and clinical problems.


Algorithms , Single-Cell Analysis , Gene Expression , Gene Expression Profiling/methods , Sequence Analysis, RNA/methods , Single-Cell Analysis/methods , Exome Sequencing
18.
Bioinformatics ; 38(9): 2639-2641, 2022 04 28.
Article En | MEDLINE | ID: mdl-35238346

MOTIVATION: Single-cell RNA sequencing technologies facilitate the characterization of transcriptomic landscapes in diverse species, tissues and cell types with unprecedented molecular resolution. In order to better understand animal development, physiology, and pathology, unsupervised clustering analysis is often used to identify relevant cell populations. Although considerable progress has been made in terms of clustering algorithms in recent years, it remains challenging to evaluate the quality of the inferred single-cell clusters, which can greatly impact downstream analysis and interpretation. RESULTS: We propose a bioinformatics tool named Phitest to analyze the homogeneity of single-cell populations. Phitest is able to distinguish between homogeneous and heterogeneous cell populations, providing an objective and automatic method to optimize the performance of single-cell clustering analysis. AVAILABILITY AND IMPLEMENTATION: The PhitestR package is freely available on both Github (https://github.com/Vivianstats/PhitestR) and the Comprehensive R Archive Network (CRAN). There is no new genomic data associated with this article. Published data used in the analysis are described in detail in the Supplementary Data. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Single-Cell Analysis , Software , Animals , Cluster Analysis , Algorithms , Transcriptome
19.
J Comput Biol ; 29(1): 23-26, 2022 01.
Article En | MEDLINE | ID: mdl-35020490

scDesign2 is a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured. This article shows how to download and install the scDesign2 R package, how to fit probabilistic models (one per cell type) to real data and simulate synthetic data from the fitted models, and how to use scDesign2 to guide experimental design and benchmark computational methods. Finally, a note is given about cell clustering as a preprocessing step before model fitting and data simulation.


Gene Expression Profiling/statistics & numerical data , Single-Cell Analysis/statistics & numerical data , Software , Algorithms , Animals , Cluster Analysis , Computational Biology , Computer Simulation , Databases, Nucleic Acid/statistics & numerical data , Gene Expression , Mice , Models, Statistical , RNA-Seq/statistics & numerical data
20.
Sci Rep ; 11(1): 23522, 2021 12 07.
Article En | MEDLINE | ID: mdl-34876638

Many computational pipelines exist for the detection of differentially expressed genes. However, computational pipelines for functional gene detection rarely exist. We developed a new computational pipeline for functional gene identification from transcriptome profiling data. Key features of the pipeline include batch effect correction, clustering optimization by gap statistics, gene ontology analysis of clustered genes, and literature analysis for functional gene discovery. By leveraging this pipeline on RNA-seq datasets from two mouse retinal development studies, we identified 7 candidate genes involved in the formation of the photoreceptor outer segment. The expression of top three candidate genes (Pde8b, Laptm4b, and Nr1h4) in the outer segment of the developing mouse retina were experimentally validated by immunohistochemical analysis. This computational pipeline can accurately predict novel functional gene for a specific biological process, e.g., development of the outer segment and synapses of the photoreceptor cells in the mouse retina. This pipeline can also be useful to discover functional genes for other biological processes and in other organs and tissues.


Gene Expression Profiling/methods , Gene Regulatory Networks/genetics , Transcriptome/genetics , Animals , Cluster Analysis , Computational Biology/methods , Gene Ontology , Genetic Association Studies/methods , Mice , RNA-Seq/methods , Retina/physiology , Sequence Analysis, RNA/methods
...